Mining Relatednsess Graphs for Data Integration
نویسندگان
چکیده
In this paper, we present the AbsMatcher system for schema matching which uses a graph based approach. The primary contribution of this paper is the development of new types of relationships for generating graph edges and the effectiveness of integrating schemas using those graphs. AbsMatcher creates a graph of related attributes within a schema, mines similarity between attributes in different schemas, and then combines all information using the ABSURDIST graph matching algorithm. The attribute-to-attribute relationships this paper focuses on are semantic in nature and have few requirements for format or structure. These relationships sources provide a baseline which can be improved upon with relationships specific to formats, such as XML or a relational database. Simulations demonstrate how the use of automatically mined graphs of within-schema relationships, when combined with cross-schema pair-wise similarity, can result in matching accuracy not attainable by either source of information on its own.
منابع مشابه
CET: A Tool for Creative Exploration of Graphs
We present a tool for interactive exploration of graphs that integrates advanced graph mining methods in an interactive visualization framework. The tool enables efficient exploration and analysis of complex graph structures. For flexible integration of state-of-the-art graph mining methods, the viewer makes use of the open source data mining platform KNIME. In contrast to existing graph visual...
متن کاملExploration of Kahang porphyry copper deposit using advanced integration of geological, remote sensing, geochemical, and magnetics data
The purpose of mineral exploration is to find ore deposits. The main aim of this work is to use the fuzzy inference system to integrate the exploration layers including the geological, remote sensing, geochemical, and magnetic data. The studied area was the porphyry copper deposit of the Kahang area in the preliminary stage of exploration. Overlaying of rock units and tectonic layers were used ...
متن کاملGraph Pattern Mining for Business Decision Support
To which extent can graph pattern mining enrich business intelligence? This question was the seed whose sprout became my PhD research. To find an answer, I investigated graph-based data integration, the calculation of business measures from graphs and suitable data mining techniques based thereon. The latter should identify correlations between occurrences of specific graph patterns and values ...
متن کاملLinear Time Planarity Testing and Embedding of Strongly Connected Cyclic Level Graphs
Abstract. A level graph is a directed acyclic graph with a level assignment for each node. Such graphs play a prominent role in graph drawing. They express strict dependencies and occur in many areas, e. g., in scheduling problems and program inheritance structures. In this paper we extend level graphs to cyclic level graphs. Such graphs occur as repeating processes in cyclic scheduling, visual...
متن کاملDevelopment of a Data Mining Education Framework for Data Visualization in Distance Learning Environments
With the increasing interest in developing Learning Analytics tools that can be integrated into the well-known Moodle course management systems nowadays, many tools have already been developed. These tools usually requires the user to know data mining techniques, and also requires time to get mining results from the tools. To address this problem, in this article, we present a structure that us...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012